Path finding strategies in scale-free networks 
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We numerically investigate the scale-free network model of Barabasi and Albert [A. L. Barabasi 
and R. Albert, Science 286, 509 (1999)] through the use of various path finding strategies. In real 
networks, global network information is not accessible to each vertex, and the actual path connecting 
two vertices can sometimes be much longer than the shortest one. A generalized diameter depending 
on the actual path finding strategy is introduced, and a simple strategy, which utilizes only local 
information on the connectivity, is suggested and shown to yield small-world behavior; the diameter 
D of the network increases logarithmically with the network size A'^, the same as is found with global 
strategy. If paths are sought at random, D ~ A''"'^ is found. 

PACS numbers: 89.75.Hc, 64.60.Fr, 84.35. +i, 87.23. Ge 
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Complex network systems are abundant in many disci- 
plines of sciences: social networks composed of individu- 
als interacting through social connections, the world-wide 
web where vertices are documents and edges are hyper- 
links to other documents, genetic networks with genes 
or proteins connected by chemical interactions, and eco- 
nomic networks, and so on [0. Topological information 
on such complex networks is becoming more and more 
available through computer aided techniques, and there 
is growing interest in the investigation of properties of 
such networks. 

In complex networks, two remote vertices (or nodes) 
can be connected by many paths. Among those paths 
the length of the shortest one usually defines the "dis- 
tance" ds{i,j) between the two vertices and the 
diameter Dg of the network is defined to be either the 
maximum distance Dg = maxdsii, j), or the average dis- 
tance Dg = {ds{i,j)). We note that finding the shortest 
path between two vertices requires global information on 
how all vertices are interconnected, which is usually not 
accessible to vertices that constitute the network. From 
this, we suggest that the actual path finding strategy 
should be based only on local information and should 
introduce the generalized distance d(i,j) and diameter 
D = {d{i,j)), which depend on the actual path finding 
strategy. In Ref. a local path finding strategy based 
on geometric information was suggested and the delivery 
time in Ref. corresponds to our generalized distance 
d{i,j). 

A recent study of the scale-free networks has revealed 
that the small-world phenomenon (that two distinct ver- 
tices can be connected by a remarkably small number of 
intervening vertices) emerges, i.e., Dg ~ log^o ^ with the 
network size However, only the shortest paths have 

been examined and the scaling behavior can in principle 



be very different if a more realistic path finding strategy 
is used. In the experiment performed by Milgram |Q, ^ 
the person in Nebraska could not know what the short- 
est connection was to send a message to the final re- 
cipient in Boston, but only tried to deliver it to his/her 
directly connected (on a first-name basis) friend who was 
supposed to be closer to the person targeted in Boston. 
Also, for the World-Wide Web (WWW), although two 
documents are just a few clicks away if the shortest path 
is traced it docs not necessarily guarantee that one 
can get the information easily; sometimes it is possible 
to go around long distances to access a document which 
turns out to be only a few clicks away from the starting 
document. Consequently, the small- world phenomenon 
observed in scale-free networks through the use of the 
global path finding strategy still remains to be confirmed 
by use of a local strategy. In this work we suggest sev- 
eral path finding strategies and show that there indeed 
exists a very simple local strategy which results in the 
small-world phenomenon. 

We first construct the scale- free networks following the 
same method used in Ref. by Barabasi and Albert 
(BA): Starting with a small number (toq) of vertices (or 
nodes), a new vertex with m edges (or links) is added at 
each time step in such a way that the probability H^ of 
being connected to the existing vertex i is proportional to 
the connectivity ki (the number of vertices directly con- 
nected to i) of that vertex, i.e., H^ — {hi + 1)/ + 1) 
with summation over the whole network at a given in- 
stant In Ref. ||], it has been shown that the above 
method of constructing networks, which is composed of 
ideas of growth and preferential attachment, results in 
the so-called scale-free networks, which shows the power- 
law behavior in the connectivity distribution. Once the 
BA model network is constructed, we seek a path that 
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FIG. 1: Comparison of (a) SHT global strategy and (b) lo- 
cal MAX strategy in finding a path connecting vertices p and 
q (see the text for details), (a) For SHT, the path-finding 
algorithm is similar to pouring water into starting vertex p: 
water fills vertices with distance d -\- 1 one step after it fills 
vertices with d. (b) For MAX, one proceeds vertex by ver- 
tex by choosing the one with the largest connectivity among 
directly connected vertices. 



connects two vertices in the network. At this stage, it 
is possible to apply various strategies; for example, in 
previous studies |^ , the shortest paths have been consid- 
ered, the searching of which requires global information 
on the interconnections. In real network systems like the 
WWW, the social network, and epidemic spread, it is 
very unlikely that each vertex has global information and 
thus one expects that each vertex has enough information 
only for its directly connected vertices. In other words, 
the strategy used when finding paths that connect two 
vertices should be based only on local information not on 
global. 

In this work we suggest three local strategies that one 
can use to find a path that connects two vertices: (i) the 
vertex with the largest connectivity is tried first (maxi- 
mum connectivity first, MAX for short), (ii) a vertex is 
chosen at random (random choice, RND), and (iii) the 
vertex with the larger connectivity has the higher prob- 
ability to be chosen (preferential choice, PRF). We also 
compare the results with the previously studied global 
strategy which seeks the shortest path (SHT). In the 
above experiment by Milgram, for example, the simplest 
and most intuitive (and quite efficient indeed) strategy 
one can use is to first ask the friend who has more friends 
than others, which corresponds to the MAX strategy in 
this work. In reality, it is possible that one does not know 
who has more friends than others, but even in this case 
(s)he can try to first ask the friend who seems to have 
more friends than others; this then has a close resem- 
blance to PRF strategy. The most naive strategy (it is 
not a strategy as a matter of fact) will be asking anyone 
(s)he sees first, which corresponds to RND strategy. 

Figures 0(a) and|l|(b), respectively, show a comparison 
between SHT global strategy and local MAX strategy 
when the path from vertex p to vertex q is sought. In 
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FIG. 2: Diameter D of the scale-free network vs size A'^. 

(a) The shortest path (SHT) strategy, which requires global 
information on interconnections, results in D ~ loSio ^ @ • 

(b) Local MAX strategy also gives rise to the same behavior 
as global SHT strategy, D ~ logjQ A'', although the D is bigger 
than that of SHT. (c), (d) Local RND and PRF strategies are 
better described by the power-law behavior D ~ A"'^^ and 
~ A" "^*, respectively. See the text for a description of each 
strategy. 



SHT, we give the distance as d = to vertex p [the first 
diagram in Fig. |^(a)], and d = 1 is given to all vertices 
which are directly connected to p [the second diagram in 
Fig. [^(a)]. Then the strategy proceeds to fill all vertices 
with d ~ 2 from vertices with d = 1, and then d = 3, 
and so on, until d values are given to all vertices in the 
network. According to this SHT strategy the d value at- 
tached to vertex q is simply its distance from p: In the 
example shown in Fig. ^a), one has d = 3 for the path 
connecting p and g, denoted by the thick lines in the last 
diagram in Fig. |l|(a). It is clear that SHT determines 
the shortest path lengths but by using global informa- 
tion. Figure |^(b) explains local MAX strategy: one starts 
at vertex p and looks around its directly connected ver- 
tices to find which vertex has the largest connectivity. In 
Fig. |l|(b) , four vertices directly connected to p have con- 
nectivities of /c = 1, 2, 2, and 3, respectively, and thus we 
proceed to the vertex with fc = 3 as shown in the second 
diagram. This strategy proceeds until one arrives at the 
vertex that is directly connected to q. In the example in 
Fig. |l|(b) for local MAX strategy, one has the path de- 
noted by the thick lines in the last diagram of Fig. |l|(b) 
with d = 3. The PRF strategy tries to proceed to the ver- 
tex with probability proportional to the connectivity, i.e., 
P — ki/ kj with summation over directly connected 
vertices of i. In the above example in Fig. |l](b), PRF 
connects to the vertex with fc = 3 [see the first diagram 
in Fig. g(b)] with probability P = 3/(H-2-f2-h3) = 3/8. 
Of course, RND connects to a vertex at random at every 
stage in path finding. 

The BA model network in this work is constructed with 
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mo = m = 2 and the paths are found according to the 
above mentioned various strategies. If the path contains 
self-crossing loops, we remove them when distance d is 
computed. Figures ||(a)-||(b) display the scahng behav- 
ior of the generalized diameter of the scale-free network 
for the strategies SHT, MAX, RND, and PRF, respec- 
tively. To get better statistics, the diameters were ob- 
tained from averages over many different network real- 
izations. It is clearly shown that although the diameter 
determined from local MAX strategy is a little bigger 
than that from global SHT strategy and both exhibit 
the same small- world phenomenon, i.e., D ^ log^Q N 
with network size N. On the other hand, RND strat- 
egy shows the power-law behavior D ~ N^-^ Finally, 
PRF strategy shows the scaling behavior, which fits well 
to power-law D ~ A^O-^^ [ggg pig_ ^(d)]. As expected, the 
scaling behavior of the generalized diameter of the net- 
work depends not only on the structure of the networks 
but also on the strategy used in path finding. It should 
also be noted that a very simple MAX strategy, based on 
local information instead of global, can result in the same 
small-world effect as global SHT strategy. On the other 
hand, we are puzzled by the fact that PRF and MAX 
strategies show very different scaling behaviors although 
both look quite similar. To compare both strategies in 
a more careful way, one can change the probability used 
by PRF, e.g., P oc fc" with a > 1, to suppress the prob- 
ability of the vertex with the lower connectivity being 
chosen. 

For comparison, we also study the scaling behavior of 
the diameter of the small-world network model by Watts 
and Strogatz (WS) In the WS model, each vertex 
is almost equal to each other and there does not exist a 
governing dominant vertex with very large connectivity. 
This feature makes our local MAX and PRF strategies 
inefficient and from numerical investigations we find that 
none of the local strategies tried in this work yields the 
small-world phenomenon but gives D ~ 7V° with a ~ 0.6 
unanimously (we have used the range of local connections 
K = 3 and the rewiring probability P — 0.1; see Watts 
Strogatz 's work in Ref. 0] for details of the model). This 
is in a sharp contrast to the result from the SHT, which 
reveals D ^ log^o N. From the above observation, we 
conclude that the success of strategy MAX for the BA 
model is due to the existence of the highly connected ver- 
tices, thus reflecting the scale-free nature of the network. 
It is of note that in Ref. an efficient local path finding 
strategy based on geometric informations was suggested 
for a two-dimensional small- world network model: When 
the path from vertex A to target B is found, the strategy 
first connects C which is closest to B in a geometric sense 
among the A's directly connected vertices. We believe 
that our local strategies in this work have some advan- 
tage when more abstract networks like the Internet and 
WWW are involved: In such networks vertices do not 
have coordinates and path finding strategies cannot use 
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FIG. 3: Ratio r of failed connections vs the vertex fail- 
ure fraction /[ = (number of vertices removed by errors or 
attacks)/(number of vertices)]. The error tolerance and the 
attack vulnerability of the scale-free networks do not depend 
on the path-finding strategy. The network size, N = 1000, 
and more than 100 different network realizations were used 
to compute average values for each strategy. 
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FIG. 4: Error tolerance and attack vulnerability reflected 
in the diameter of the network for strategies (a) SHT, (b) 
MAX, (c) RND, and (d) PRF (see the text for details). AU 
strategies show similar behavior as implied by Fig. ^ except 
that C decreases rather than increases for RND in the case of 
failures due to errors. A'' = 1000 and more than 100 different 
network realizations were used. 



geometric information. 

We finally examine the error and attack tolerance of 
the BA model (see Ref. Q for a study based on SHT 
strategy) that is subjected to the various strategies. We 
first remove N' vertices in the network either randomly 
(error) or starting from the vertex with the highest con- 
nectivity (attack) and then measure how many pairs 
of vertices are connected by a given path finding strat- 
egy. More specifically, we pick two vertices, p and (7, and 
examine if q can be connected to p by the path finding 
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strategy given. Since there are a total of N{N — 1) possi- 
ble choices for the pair {p, q) in a network of size N, the 
ratio r of failed connections in Fig. || is then defined as 



_ (number of failed connections) 
" N{N - 1) ■ 



(1) 



The tolerance in this article is detected either by r in 
Eq. (P or by the change in diameter d of the network. 
The former should not depend on the strategy since the 
strategy chosen can only change the path lengths be- 
tween vertices once they belong to the same cluster. As 
expected, r versus the failure fraction / = N' /N (the 
ratio between the number of vertices removed and the 
total number of vertices in the network) in Fig. |^ shows 
the behavior independent of the strategy. This robust- 
ness to the path finding strategy implies that the error 
tolerance and the attack vulnerability found in Ref. ||] 
are genuine topological characteristics of a scale-free net- 
work. Figures ^a)-|^(d) confirm the similar tolerance 
behavior reflected in diameter d to errors and attacks for 
various strategies: SHT, MAX, RND, and PRF, respec- 
tively. For RND strategy in Fig. ^(c), it can be seen that 
removal of randomly chosen vertices makes d smaller as 
/ is increased from zero; this somewhat interesting be- 
havior is due to the fact that in RND path finding can be 
more efficient if unimportant connections are removed. 

In summary, we have investigated the possibility of the 
small-world phenomenon in the scale-free network sub- 
jected to several local strategies of path finding. It was 
found that there is a very simple local strategy based on 
local connectivity information which leads to the scal- 
ing behavior D ^ logj^p The error tolerance and the 
attack vulnerability found previously by global strategy 
have been shown to be generic topological properties of 
scale-free networks, and do not depend on the path find- 
ing strategy. 

S.K.H. and B.J.K. acknowledge support from the BK 
21 project of the Ministry of Education in Korea. This 
work was supported in part by the Swedish Natural Re- 
search Council through Contract No. F5102-659/2001 
(B.J.K.), and by the NSF through Grant No. PHY- 



9988674 and CAREER DMR97-01998 (H.J.). 

Note Added in Proof: After submission of this paper we 
found that the similar local search algorithm in networks 
with power-law connectivity distribution was studied in 
Ref. PI. 
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